Electromagnetism and Quantum 

Theory 

Luis Fernando Alday* 

Parti: Electromagnetism 

This half of the course is a short introduction to the mathematical description of the salient 
points of the theory of electromagnetism. I will assume that you have some familiarity 
with the physical phenomena of electromagnetism which the theory aims to encompass and 
account for. For many of you this will have come from physics courses in school, and can 
be reinforced by consulting a book like Duffin (see below). Our main concern is with the 
formalism: The language of vector operators and the integral theorems of Gauss, Green 
and Stokes are precisely tailored to handle electromagnetic theory. Once one has the theory 
written that way, there are various tantalising hints of how to carry it forward into the 
four-dimensional space-time formalism of special relativity - but that is another course. 

Synopsis 

Maxwell's equations in vacuum with sources. Lorentz force law. Plane waves and polariza- 
tion. Electrostatics and Magneto statics. Energy density and the Poynting vector. Scalar and 
vector potentials. Gauge invariance. Maxwell's equations in the Lorentz gauge. The wave 
equation for potentials. 

Reading list 

One good source, with extensive discussion of the experimental background, which I shall of 
necessity by omitting is: 

W.J. Duffin, Electricity and Magnetism, McGraw-Hill (2001), Fourth Edition, Chapters 1- 
4,7,8,13. 

Classic texts on electromagnetism: 

R. Feynman, Lectures in Physics, Vol.2. Electromagnetism, Addison Wesley. 
J.D. Jackson, Classical Electrodynamics, John Wiley. 



* Largely based on notes by Paul Tod. 



1 Time independent Electromagnetic Theory 



1.1 Electrostatics 



This is the subject of static electricity, which we've all played with: recall rubbing a pen on 
your cat and picking up bits of paper. We interpret this as charging up or adding charge to 
the pen. As greeks didn't have pens, they used amber. The greek word for amber is electron 
and this is where the subject gets its name from. 

We will assume the following facts about charge: 

• there exist charges in nature; 

• charge can be positive or negative; 

• charge cannot be created or destroyed; 



For a more sophisticated standpoint, we may know that charge is discrete, and that electrons 
have charge -1 and protons charge +1 in suitable units, but for our purposes we want to 
think of charge as continuous. As a mathematical idealization, we represent charge by a real 
function, the charge density p(x,y, z,t), with the property that the total charge in a region 
V of space is 



Q = J pdV = J J J P dxdydz. (1) 

Other idealizations are possible, and we shall sometimes use them. For example, it may be 
convenient to think of charges residing on a surface, something two-dimensional, and then 
charge surface density o~(x,y,t) is charge per unit area, so that the charge on a piece S of 
surface is 



Q = / odS- (2) 



s 

similarly, for a one-dimensional distribution of charge, charge line density is charge per unit 
length on some curve. Finally, a point charge corresponds to a finite, nonzero charge localized 
at a point. 

How do charges interact with each other? An important experimental fact about point 
charges is the 

Coulomb's Law: Given two point particles P, P' with charges Q, Q' at positions r, r 1 , the 
electric force on P due to P' is 

r — r 1 

F = - — QQ Z — ^3 (3) 
47re \r — r \ 6 

where eo is a constant and accounts for the correct units. 
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We treat this law as an experimental fact. To see how one might have discovered it, note 
that: 

• it is an inverse-square law, acting along the line joining the particles; 

• it is proportional to the product of the charges, so that like charges repel and unlike 
charges attract. 

It is convenient to think of this law as follows: P' generates an electric field to which P 
responds. 

Put P' at the origin and define the electric field at r to be 

E(r) = ~r~Q' ~^ (4) 
47re r 3 

Then P, which has charge Q, when placed in this field is subject to (or "feels") the force 

F = QE (5) 

Replace P' by various particles Pi, Pi, P n , at points fi, r 2 , ■■■,f n with charges Qi, Q2, Q n - 
The forces will add as vectors so if we defined the electric field now as 

m = ^T Q fz r 2\ (6) 

47re ^— ' r - rA 3 
i=i ' 11 

then, as before, P placed at f is subject to the force 

F{f) = QE(f) (7) 
We want to explore (6). Note first that it is the gradient of another function: 



E=-V®, $ = 

4ire ^—f \r - n 
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This function $ is called the electric potential. In particular, therefore 

VA£ = 0, (9) 

so the force (7) is a conservative force in the language of mechanics (familiar to you from 
Mods). We also have 



V • E = V 2< 3> = 0, except at f — 



(10) 



Consider the diagram 



large sphere S 




i.e. S is a large sphere containing all Pi, Si is a small sphere containing only Pj and V is the 
region in between. 

Then V • E = on V\ so 



[ E-dS-J2 f E ■ dS = I 'y ■ EdV = 

J S i=l V 



(ii) 



the first two are surface integrals (with dS pointing away from the center of the corresponding 
sphere), while the last one is a volume integral. Make sure you understand the signs in this 
equation. Let us focus in the integral over Si, taking ri = for simplicity we obtain: 



E-dS 



Si 



47ren 



Qi—z ■ n + terms integrating to zero ) dS 



4:7760 
1 



Qi 



-r 2 sin 



1 



in the second step we used n = -. Each term in the sum (11) can be calculated like this, so 
we obtain: 



I E-dS = Y] [ E-dS = — (Qi + Q 2 + ... + Q n ) = — x total charge inside S (12) 

This is the Gauss 's Law in its integral form: 

Gauss's Law: The flux of E out of V = — x total charge in V. 

If we have a smoothed out charge density instead of point charges, Gauss's law would read 

E ■ dS = — [ pdV, (13) 



>s e o Jv 

where V is the volume inside the closed surface S. We will assume this to be true. 



Now, by Divergence theorem we have 



but if this is to hold for all possible regions V, then 



V ■ E - —p ) dV = (14) 



V ■ E = -p, (15) 

which is the differential version of Gauss's Law. 

This together with E = — V$ implies Poisson's equation 

V 2 $ = --p (16) 
What is the solution for this equation? remember that for point particles we had (see (8)) 

$ = ^EAf ( 17 ) 

47re ^— ' r - rA 
i=i ' 11 



so we might guess that the solution of (16) is 



•v = 4kllLw=kw (18) 



— H?T ai "'' 

-0 J J Jv 

where V" is the region in which p ^ 0. In fact, this is the correct guess. 



An example 

The main problem of Electrostatics is to obtain E given the charge distribution. We have 
solved that problem with (18): Given the density p, integrate this to find $ and then 
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E = — V$. However, in simple cases we can go straight to E by using (4). Here is an 
example: 

Total charge Q is spread out uniformly round a plane circular wire of radius a. Find the 
electric field at a point P on the axis of the circle, at a distance b from the centre. 




unit vectors 



We can do this one directly from the inverse square law: E 

Cut the circular wire into elements, each of legth ad9; then each contains charge equal to 
Qd6 and so contributes ^dO^-j^, (with e as in the figure) to E at P. Adding these up 

around the circle leads, by symmetry, to a vector along k. Note, from the diagram, that 
e-k = cos a, so that E(p) = Ek, with E a scalar and given by 



E 



Q 1 cos a 

— • do ■ • — — 

27r 47re L 2 

Q cos a 



47re L 2 

Q b 
47re (a 2 + b 2 fl 2 

which is the answer. 

It is trickier to find E for a point P' not on the axis. 
$ and the Potential Energy 

Let us think about Newton's equations of motion for a charged particle of mass m and charge 
Q subject to the force (7) 



ma = F = QE = -QV$ 



(19) 



Then 



d 

dt V 2 



-mv ■ v 



mv ■ a 



i=i 



9$ dx i 
dx l dt 



-Q 



d$ 

dt 



(20) 
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so that 



E = -m\v\ 2 + Q<& = constant 



(21) 



In the motion of the charged particle, the kinetic energy changes with time, but the sum 
of the kinetic energy and Q§ is constant; thus Q<3> is the potential energy of the particle of 
charge Q in the electric field with potential $. 1 

1.2 Magnet ost at ics 

Frequently, people find magnetism a little more mysterious than electricity. Rather than 
thinking of fridge magnets, you should think of electromagnets: charges in motion give rise 
to electric currents and these produce magnetic fields; many of you will have visualized 
these fields in experiments by sprinkling iron filings on sheets of cardboard transverse to a 
current-carrying wire. (The fridge magnet, or any permanent magnet, derives its magnetism 
from microscopic currents.) The subject corresponding to electrostatics, which is produced 
by time-independent charges, is magnetostatics, which is produced by time-independent, or 
"steady", currents. 



Given a collection of point charges Qi in motion with velocities Vi, we define the corresponding 
electric currents as: 



This is a vector field made by adding up elementary contributions. By analogy with charge 
density p as a smoothed-out distribution of point charges, we introduce a vector field, the 



In this section we shall usually be thinking of steady currents, but there is one thing to deal 
with first, namely the mathematical expression for the physical observation than charge is 
conserved. 

Conservation of charge 

Consider a region V with surface S with charges moving through. 

1 Note that this is completely analogue to the usual (gravitational) potential energy, the mass m is replaced 
by the charge Q and the potential gh is replaced by the potential <£>. 
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The total charge inside V is Q = J v pdV, so that 



dQ 
dt 



f 3 P 

I ~^-dV allowing 7^ for the moment 
Jv 

rate of increase of Q 

rate charge goes in — rate charge goes out 



J-dS 

<s 



V-JdV 



an so 

dp 

this is to be true for all regions V, then 



by divergence theorem 



V • J \dV = 



dp 
dt 



+ V- J = 



(22) 



(23) 



which is the charge conservation equation. 

For the rest of this section, we suppose none of the quantities of interest depends on time. 
For this time independent situation (23) reduces to V ■ J = 0. 

Magnetic Field Strength 

Like the electric field E, the magnetic field strength B can be defined from an experimentally 
established force law. Analogously to (7), a charged particle with charge Q, moving with 
velocity v in a magnetic field B is subject to a force 



F = QvA B 



(24) 
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This serves to define B, but note a peculiar feature of this force law, that the force is 
orthogonal to the velocity. In particular, therefore v ■ F = and the force does no work. 

Forces between two parallel current-carrying wires 



Ii= Iik 



lines of force 



I 2 = I 2 k 



To see (24) in action, let us consider the force between two parallel current-carrying wires. 

What can we say about the magnetic field B generated by a single wire? let us choose coordi- 
nates such that the wire is along the z direction and let us use cylindrical-polar coordinates. 
It can be experimentally verified that if we take a charge Q and move it around the wire, the 
charge is subject to a force. This force vanishes, i.e. there is no force, if the charge moves 
along the 9 direction, while there is a force if the charge moves in any other direction. This 
means that B has a component only in the 9 direction 2 ; using the symmetry in 9 and z we 
can write B = B(R)eg, where R is the distance to the wire. 

Now suppose we have parallel wires, wire 1 carrying current I\ and wire 2 carrying current 
I2. Each wire gives rise to a B- field and the field from one exerts a force on the current in 
the other. Using (24), and thinking of the currents as charged particles in motion, we have: 

Force per unit length on wire 2 due to wire 1 is 

F12 = I2 A B\ . 

2 This is consistent with the following fact: if you sprinkle iron filings on a cardboard held at right angle 
to a current-carrying wire, you can see that the magnetic lines of force are concentric circles, centered on the 
wire. 
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where B\ means B at wire 2 due to current in wire 1, and from the geometry I 2 = I 2 k. 
Similarly, the force on wire 1 due to wire 2 is 



F21 = I\ A B 2 . 

— * — * — * 

where B 2 is B at wire 1 due to wire 2, and Ii = I\k. These forces must be equal in magnitude 
and opposite, due to Newton's third law: 

f 2 a B x = -h A B 2 . 

From what was said about the direction of B, we have that at wire 2, B\ = jB\ in terms of 
some magnitude £>i, and in wire 1, B 2 = —jB 2 in terms of some magnitude B 2 . Therefore 

I-P12I = hB\ = \F 21 \ = l\B 2 . 
It is an experimental fact that this magnitude is 

\F\ = p-hh 

in terms of a constant /^o (which, as with eo is needed to get the right dimensions). 
From this experimental fact we deduce that, at wire 2, 

B = j 

2ira 

At a general point, in terms of cylindrical polar coordinates (R, 6, z) with the wire along the 
z-axis this is 

Vol 1 

~^R ee 

which we can write in cartesian coordinates as 

Vol f_V_ x n \ d2 _ ji , ..2 



B = %\-wv>»h R=x+V (25) 

This is the magnetic field due to an infinite straight wire with constant current, which can 
be though of as the elementary magnetic field, much as (4), E — ^ gives the elementary 
electric field. 

From (25), for R 7^ 0, we calculate 



V • B = 0, VA5 = (26) 

Our aim for the next few pages is to calculate the B due to current in an arbitrary wire, 
much as the integral (18) together with the relation E = — V$ gives E for an arbitrary 
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charge distribution. We achieve this aim with (32) below. As a step towards this aim, we 
note that, when T is a horizontal circle of radius R centered on a straight wire 



Vol 



-eg ■ eeRd9 



2ir J R 
= Vol 

Even though we have derived this for a straight wire, the result is much more general: 
Ampere's Law: § T B ■ di — /iqX total current through T. 



(27) 
(28) 



/ 



Compare ampere's law to Gauss Law in the first form we had. 

For a current density instead of a wire, we obtain an integral version of Ampere's Law: 

spans 1 

V A B ■ dS by Stoke's theorem 
and for this to be true for all E we obtain the differential version of Ampere's law: 

V A B = VoJ (29) 

(recall that V ■ J = from charge conservation and time independence, and we need this for 
(29) to make sense). 

By looking at (26) we have V • B = 0, which is to be contrasted with (15): V • E — —p. 
Thus we interpret (26) as saying "there are no magnetic charges". 

The magnetic potential 
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It follows from (9) that, in a simply-connected region, there exists a function $ such that 
E = — V$. It is a less familiar fact that 

Claim: if V ■ B = in a suitable region, then there exists in that region a vector field A 
such that B = V A A. 

You will prove this on problem sheet 2. A is the magnetic potential (also called vector 
potential, in which case $ is called the scalar potential). 

Note that a change A — > A + VC, for any scalar function (, leaves B unchanged. We may 
exploit this freedom to impose another condition, namely 

V-A = 

for suppose F = V • A ^ and change A — > A + V • £, then this would change F — > F + V 2 C; 
now, choose ( such that V 2 C = — F, so that now V ■ A = 0. 

Equation (29) can be turned into an equation for A as follows: 



VAB = VA(VAA) = V(V-A)- V 2 A 

= Vo? (30) 
with the choice V • A = 0, then we have 

V 2 A = -^ Q J (31) 

Remember that this is true for the particular choice V ■ A = 0. This has the vector form of 
the Poisson's equation, and we've solved Poisson's equation before: recall (18) and compare 
with (16). So we solve (31) by 



This is for a volume distribution of current. If we have just a line current J = It, where I is 
constant and t is the tangent (of unit length) to a curve L (where the curve L is the "wire"), 
then instead 

-> u,qI f tdi 
A 



4tt J l |r-r%0| 



12 



I 



where i parametrizes where we are along the wire. 
From A we calculate B as 



B 



vaI=^ f va 

4?r J l 
An It r — r r 



or 



B 



fi I f dt A (f - f") 



47T 



i^t^ 3 



(11 



(32) 



Where <i£ = d£t. This is now i? at all points of space due to an arbitrary wire, and so 
generalizes (25). We can use it to obtain an expression for the force between two arbitrary 
wires, by chopping the second wire into elementary pieces. 



At P2 on wire 2 




so 



dF = I 2 d£ 2 A B 1 



hd£ 2 A B x 



An 



A {dii A (f 2 - n)) 
\r 2 - nl 3 



(33) 
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where fi = belongs to wire 1 and r*2 = ^2(^2) belongs to wire 2. This is the Biot-Savart 

Law for the force between two arbitrary current-carrying wires (more precisely, the force on 
wire 2 done by wire 1). Its interest is largely theoretical, as it is rather hard to use. 

1.3 The story so far 

We've studied time-independent electricity (or electrostatics) and time-independent mag- 
netism (or magnetostatics), and found these to be governed by the following system of 
equations 

V -E=—p V -B = 

eo 

V A £ = VAB = fi J (34) 

with the understanding that 

dE _ dB _ d J _ dp 
~dt ~ ~ ~dt ~ ~di ~ di' 

The electric and magnetic field strengths themselves can be defined from the force laws (7) 
and (24), which we combine as follows: a particle with charge Q moving with velocity v in 
a combination of an electric field E and a magnetic field B is subject to the force given by 

F = Q(E + vAB). 

In addition we have one equation which we expect to be valid also in the time-dependent 
case, namely the charge conservation equation 

^ + V-J = 0. (35) 

The problem now is how to extend all these equations to the time-dependent case, i.e. how 
to put back the terms that are zero in the time-independent case. This will be the subject 
of the second part of this course. 
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2 Time Dependent Electromagnetism: Maxwell's Equa- 
tions 



Experimental evidence suggests two qualitative facts: 

• A time-varying B produces an E. 

• A time-varying E produces a B. 

This experimental evidence can be summarized quantitatively in the following two integral 
laws: 

j) E ■ dl = — — J B ■ dS Faraday's law of induction (36) 

<j>B-dl = J /i J-dS + j e^J^-dS (37) 
Where £ spans Y. Apply Stoke's theorem to both to obtain 



V A B - vof- e /i — I • dS = (39) 



Of 



and if these hold for all £ we obtain the 
Maxwell equations: 



x dB ^ x - 1 dE 

VAE + - =0; =^ J+ c^lH 

V-B =0, V • E = -p (40) 

with c 2 = — — . 

These are the Maxwell's equations, the basic equations of the new theory of electromagnetism, 
in which electricity and magnetism are merged. 

Note that 

• Two of the equations always have zero on the right; the other two contain the sources 
p and J. 
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• If all time-derivatives are set to zero, we recover the equations of the previous section. 

• The system obtained by setting p and J to zero is called source-free Maxwell equations. 

• Regarding p and J as given, there are eight equations for six unknowns, so there should 
be a consistency condition. 

To see what the consistency condition is, calculate: 

dp d .„ „ f dE 



-V • (V A B - fi f 

in V V 



/'0 

= -V-J 

which we recognize as the charge conservation equation. From this point of view the charge 
conservation equation is a consistency condition for the Maxwell's equations. As an exercise, 
show that there is a second consistency condition but that it is automatically satisfied. 

Energy of the Electromagnetic Field 

There is a deductive way to approach this question, but we shall rely on intuition and 
guesswork. We want an energy density (energy-per-unit-volume) which is something like 
n l/2mv- u". 

After playing around with Maxwell equations for a while, we are lead to consider 

£ = U\E\ 2 + -^\B\* (41) 

z Z/i 

Then, using the Maxwell equations, we calculate 



dE e e 9E + 1 B 9B 
dt dt p dt 



E ■ (V A B - pof) - —B ■ V A E 
V / Mn 



Po V / p 

= -V- (—EAB) -E-J 
\t*o J 

For the source-free case, we set J = so that the last term vanishes. Then, this has the 
form of a conservation equations 

BE 

— + V-P = (42) 
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Remember that £ is the energy density. P = —E A B is called the Poynting vector and has 
the interpretation of the rate of energy flow, or momentum density. 

If we now reinstate the sources, this equation has a name: 

Poynting's theorem 

OS -. - - 

We integrate Poynting's theorem over a volume V with surface S to obtain an energy balance 
equation: 



d 
dt 



SdV = - / P ■ dS - / E ■ JdV 
Js Jv 



(1) (2) (3) 

where each term has the following interpretation: 

1. Rate of increase of electromagnetic energy. 

2. Rate of energy flow into V. 

3. Rate of work done by the field on sources. 



To justify the interpretation of (3), consider a single charge. Remember that work is the 
product of force and displacement dW = F-d£, so the rate of work is ^5- = F ■ v. For a single 
charge F = QE and Qv is by definition the current j, so that ^ = E ■ j. This satisfactory 
interpretation of the third term reinforces the interpretation of £ as energy density. 

Potentials 

The introduction of potentials in the time- dependent case is similar to that in the time- 
independent case. Assume we are interested in a suitable (simply-connected, etc) region of 
space, then the Maxwell equations imply, first: 

V ■ B = 3A such that B = V A A ; 
next |f = V A § , so that 

„ * dB „ / dl\ 
VA£+1F = VA^ + -j=0, 

whence 

dA 

3$ such that E + — = -V$ 
at 
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i.e. 



B = V A A (43) 
8 A 

£ = -V$ - — (44) 



The freedom to modify A and $ changes slightly: if A — > A + VC, then B is unchanged, but 
we need $ — >• $ — ^ to leave unchanged. 

This is a gauge transformation and can be exploited, as before, to simplify potentials. 



Electromagnetic Waves 

For the rest of this section, we show that the source-free Maxwell equations admit wave 
solutions, and use these solutions to recover some elementary properties of optics. The 
realization that light is an electromagnetic phenomenon was a triumph of nineteenth century 
science. 

We start from the Source-free Maxwell equations: 

V-B =0, V-E =0 (45) 
a t5 

VA.E+— =0, VA5-^f =0 
we would like to decouple the equations for E and B. Calculate: 



so that 



VA(VAE) = V(V ■ E) — W E = —V E 

„ OB d _ - 1 d 2 E 

= - VA lH=-dt {VAB) = '7^ 

1 d 2 E - 



c 2 dt 2 

which is the wave- equation with c = (eo/io) -1 ^ 2 as the wave speed. 
Similarly (exercise) 



V 2 E (46) 



1 d 2 B 



V 2 B (47) 



c 2 dt 2 

We want to solve these wave equations, so think first about a scalar version 

1 d 2 F „ „ d 2 F d 2 F d 2 F , . 

V 2 F = 1 1 (48) 



c 2 dt 2 dx 2 dy 2 dz 2 
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and try F = f(ut — K ■ x) with constant uj and K. Then 



VF=-Kf 

d 2 F 

^ = c 2 /", V 2 F = \K\*f" 

Hence, F = f(cut — K ■ x) is a solution for any (twice-differentiable) / provided \K\ 2 = 
Note that F is constant on surfaces 

ut — K ■ x = const 

at a fixed t this is the equation of a plane, so these solutions are called planewaves . If / is 
cos or sin, then they are called harmonic waves with a single frequency, or monochromatic 
plane waves. 

Now try 



E — e.\ cos + e*2 sin 0, = cut — K ■ x (49) 
B = bi cos 4> + b 2 sin 0, 

with constant vectors K e.i and Imposing the source- free Maxwell equations in order: 

V • E = — sin 0e*i • V0 + cos <pe 2 ■ V0 
= sin <pK ■ e*i — cos <pK ■ e 2 
= 

so we need 

K-ei = K-e 2 = and then K ■ E = (50) 
In the same way we also have 

V-B = 0^>K-b 1 = K- b 2 = and then K ■ B = (51) 



Next 



so 



z.e. 



K A &i sin <f) — K A e 2 cos — cu&i sin + cu&2 cos = 



u)h = K A e t (52) 
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and so 
Finally 



ouB = K f\E (53) 



so 



or 



-. 1 dE 

VAB -?aT = ° 

^ ^ W ^ OU ^ 

K A 61 sin — A 62 cos H — -e*i sin -e-j cos 1 

c 2 c 2 



^e^-KAb, (54) 
c z 

jJ E = -KNB (55) 



c 2 



From this and (52) we deduce: 



oJ 2 



C 2 



e* = ub; A K 



= — (e*i A K) A K 
= \K\% 

so that |-K"| 2 = ^ as expected and (52) implies ou\bi\ = \ei\\K\, ei-bi = 0, so 



|ej| = c|&i|, ej • 6j = (56) 

These are transverse waves (unlike e.g. sound). The waves travel in the direction of K, while 
E and B lie in wave fronts, transverse to the direction of propagation, and are orthogonal 
to each other. 

B,bi 

direction of wave 
> 




K 

E, e ; 



To simplify the expressions, take K in the ^-direction, K ■ x — Kz. Look in the plane z = 
so that d> = out. Then 



E — e*i cos out + e 2 sin out (57) 

B = —K A e*i cos out + —K A e 2 sin out (58) 
ou ou 
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To discuss the phenomenon of polarisation, we distinguish some particular cases: 
(i) €1,62 proportional, say to i, then 



for some constant 5; 



E = Eicos(ut + 5) 



B = -Ej cos(wt + 5) 



cB 



x 



(59) 
(60) 



a wave like this is said to be lineary polarised; E and B oscillate parallel to two fixed 
orthogonal directions. 

(ii) e*i, e 2 orthogonal, of equal length, e.g. e\ = Ei, e 2 = Ej, then 



E = E(i cos tot + j sin cut) (61) 
■+ E , - 

B = — (— i smut + j cosut); (62) 

a wave like this is said to be circularly polarised; E and B rotate at a constant rate about 
the direction of propagation. 



y 












• 
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The direction is anti-clockwise and the wave is left circularly polarized if e\ A e*2 • K > 0, and 
the rotation is clockwise and the wave is right circularly polarized if e\ A e?2 ■ K < 0. 

(iii) the general case can be regarded as a combination of waves with different polarizations. 
Epilogue 

What comes next in this story? The question how does one transform the Maxwell equations 
between frames of reference which are moving uniformly relative to each other? Evidently, 
waves in one frame should be waves in the other, but the big surprise is the experimental fact 
that the speed of light in vacuum is the same for all observers in uniform relative motion!. 
Since the speed of light is made from constants appearing in Maxwell's equations, this at 
least suggests that Maxwell's equations are the same in all frames. How all this works is the 
province of Special Relativity: first time and space fit together as space-time, then t and 
f fit together so that vectors have four components instead of three, now $ and A form a 
4-vector and so do p and J, while E and B fit together as an anti-symmetric 4x4 matrix. 
See B7.2b for more... 
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Part 2: Quantum Mechanics 

Synopsis 

The mathematical structure of quantum mechanics and the postulates of quantum mechan- 
ics. Commutation relations. Poisson's brackets and Dirac's quantization scheme. Heisen- 
berg's uncertainty principle. Creation and annihilation operators for the harmonic oscillator. 
Measurements and the interpretation of quantum mechanics. Schroedinger's cat. Spin 1/2 
particles. Angular momentum in quantum mechanics. Particle in a central potential. The 
hydrogen atom. 

Reading list 

The main book followed for this part of the course is: 

K. C. Hannabuss, Introduction to Quantum Mechanics (Oxford University Press, 1997). 
Classic texts on Quantum Mechanics: 

L. I. Schiff, Quantum Mechanics (3rd edition, Mc Graw Hill, 1968) 

L D Landau and E.M. Lifshitz, Quantum Mechanics Non-Relativistic Theory: Volume 3. 
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3 The Mathematical Structure of Quantum Mechanics 

(Hannabuss chapter 6) 



3.1 Reviewing Part A Course 

We shall recall what you learned there and add one new idea: 

• a particle is described by its wave function: 

i(j(x,t) = i[)(x,y, z,t) in 3 dim 
= ip(x,t) in 1 dim 

which is complex and continuous (and usually twice-differentiable in x). 

• the function p = \ijj\ 2 is interpreted as the probability density of position provided that 



i.e. provided ip is normalized. (We'll usually omit the range of integration when its 
intended to be all space). For this interpretation to make sense we at least need 



for then a constant multiple of ip is normalized. Recall also the definition of the 

probability current. 



• for a particle of mass m moving in a real potential V(x) ijj changes in time according 
to the Schrodinger equation: 




(63) 




(64) 



(65) 




(66) 



This is motivated by the Hamiltonian from Classical Mechanics 




(67) 



p — > —iKV. 



(68) 
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the Schrodinger equation is linear, so the solutions of (66) form a complex vector space, 
and given a normalizable solution, we can normalize it (multiplying it by an appropriate 
constant) and still have a solution. 

there exist separable solutions of (66): 



iJ>(x,t)=V(x)T{t) (69) 

with 

_ . . iEt , . 

T(t) = e~^r (70) 

and 

= E^> (71) 

which is an eigenvalue equation for the allowed energies. Solutions like (69) are called 
stationary states even though ^ ^ 0, because §f — — J- Correspondingly (71) is 
called the stationary state Schrodinger equation. 

in many examples (e.g. particle in a box, harmonic oscillator,...) (71) has a countably 
infinite, orthonormalizabled set of solutions : n e N} with corresponding real 

eigenvalues E n , so that, without loss of generality 

J ^ = 5 ir (72) 
These provide a basis for the general solution of (66): 

^(x,t) = J2^k(x)e-^ h (73) 

k 

If we know the wave function at the initial time ip(x, 0), then we can determine the 
constants c^. 

c k = J V k {x)iP{x,0) (74) 

Hence, the wave function at initial time determines the wave function at all times 
(as it should, since the Schrodinger equation is a first order equation in t). Now the 
normalization of ip becomes 



NEW IDEA: Now comes the new idea: since they sum to one, it is tempting to think 
of the Cj as the probabilities that the particle is in state j, and we shall. Since the 
state j has energy Ej this implies 

P{E = E 3 ) = \c 3 \\ 
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writing P for probability, and then the average value, or more properly the expectation, 
of the energy is 



E(H) = J>il<* 



(75) 



j 

note that this coincides with the usual definition of expectation 




iE k t/h 



) 



where we have used the definition from the Part A course for the expectation of a 
function f of position (x) in the state ip: 



3.2 States and Observables 

Having finished the review, we start this part of the course by seeking a more abstract setting 
for the above. We suppose that, in quantum mechanics, 

• a system can take up one of a number of states which correspond to elements of a 
complex vector space H, called the Hilbert space, which has a Hermitian inner product 
that we will write (u\v) for u, v e T-L 

• proportional vectors define the same state (so states actually correspond to one-dimensional 
subspaces or rays in %). 

• an element ip e % is normalized if 



(this still leaves a freedom ip — > ipe ta without changing the state.) 

• observables are self-adjoint linear operators A : % — > % (recall the adjoint A* of a 
linear operator A is defined by 




(76) 



|M| 2 = <V#> = 1 



(A*<f>\ip) = (4>\Ai)), for all 0, iff e U 



and A is self-adjoint if A* = A) 
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the mot important example of an H is complex-valued functions ip on M 3 with 

(<f>W) EE / # (77) 

then the position operator X = {Xi, % = 1,2,3} and the momentum operator P = 
{Pi, % = 1,2, 3} are defined (as we know) by 



Xjip = Xjtp 

Ptf = - ih *L (78) 

dXj 

Are these in fact self-adjoint? For Pj we calculate 

on integration by parts, assuming differentiability of <fi, ^ and the vanishing of the 
integrated term, which would follow from normalizability of <f),ip. Thus the Pj are 
self-adjoint, and it is trivial to check that the Xj are. (Note however, that the Pj are 
only defined on a subspace of H (normalizable functions), and that both Xi and Pi 
may spoil normalizability- we will note but pass over this kind of technical issue.) 

3.3 Statistical Issues 

Following section 3.1, we will define the expectation of an observable A in a (normalizable) 

state if) to be 

MA) S <JM (79) 
then e.g. for a function F(X) and a normalized 

E^(F) = (^\F^) = y ^ (80) 
just as before. From the abstract definition (79) we obtain 

1. E^(I) = 1 for the identity operator /; 

2. E^(A) is real for self-adjoint A; 

3. E V ,(A) > for positive A (i.e. A for which (^|A^>) > for all i/j); 

4. E^(aA + = aE^(A) + /3E^(B) for all a, /3 e C 
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All but the second are easy to see; for the second note that 



ty\A1>) = (A^) = = WW) 

for self-adjoint A. 

Next define the dispersion A^(A) of an observable A in a state ip as 

A^(A) = (E^A 2 ) - (E4A)) 2 ) 1/2 (81) 

This is like standard deviation, so it measures the spread of the measurements of A about 
the average value. 

Proposition 

A^j(A) = iff ip is an eigenvector of A. 
Proof 

Suppose ip is normalized, then for any real fi, 

E^(A - /i/) 2 ) = lfj>\{A - /i/) 2 ^) = ((A - vW\(A - »m 
since A — fil is self- adjoint 

= \\(A-^\ | 2 

Also, by linearity 

E 4 ,((A - /i/) 2 ) = E^A 2 ) - 2/jE^A) + /i 2 
so put \i = E^(A) (which is allowed, since this is real) to find 

{A^A)f = \\{A-^\\\ H = U*(A) 

Thus 

Atp(A) = iif (A - iil)ip = iif Ai/j = fii/j 

as required. 

To make contact with earlier formulae, suppose % has a (countable) basis of normalized 
eigenvalues of A, say {ipn}, with 

Aip n = a n ip n , 

then for any normalized ip e H we have an expansion ip = c n ip n with c n = (ip n \ip), so that 

E^A) = (lp\AlP) = ^C n (^\A^ n ) = J2 a nCn(^n) = 

n n n 

and we recover (75). 
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Observables are going to be the things we can measure, and the results of these measurements 
are the eigenvalues of the corresponding linear operator, but we can't assume that every 
observable of interest has normalizable eigenvectors. For example, consider P, then the 
eigenvalue equation is 

Pip = —iftVip = kip 

for a constant vector k, so the eigenfunction is const x e tk ' x f r \ which isn't normalizable. As 
another example, suppose 

Xip = dip 

for eigenvalue a. Then 

(x — a)ip(x) = 

and ip vanishes at every point except x — a. Continuity would then force ip to vanish 
everywhere (though if you are Dirac, you might invent the delta function at this point). 

As an example of dispersion, we revisit the treatment of a particle in a box from Part A. If 
the box occupies < x < a, then the Hamiltonian and normalized eigenfunctions were 

H=—P\ ip n (x) = \-sm 

2m V a a 

Now calculate 

^„(P)= / Mx)(-ihiP' n (x))dx = 
Jo 

writing prime for d/dx, so that 'on average' the particle moves neither left nor right, while 

n 2 TT 2 h 2 



E^„(P 2 ) = Ej, n (2mH) = 2mE n 



,2 



a- 

Thus the dispersion of P is not zero - the particle is jiggling about. 



3.4 Time Evolution 

In this abstract setting we will write ipt to indicate a time-dependent element of "H, then 
the time-evolution, following the Schrodinger equation, is determined by a Hamiltonian H, 
which is an observable, by 

ih 9 ^ = Hip t (82) 
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Formally this can be solved by the expression 3 



ij)t = exp ( - % - [ Hdt ) 



h 



where exp of an operator is defined by 



exp(A) = l, A k 



k=0 

and we will just note that there may be convergence issues. 

To prove (83), suppose for simplicity that H has no t-dependence so that 



/ Hdt = Ht 
Jo 



Then introduce 



so that (83) is just 



Claim: 



Ut^expl — ) (84) 



ilh = Uti>v (85) 



• U = I 

• lt = -jHU t = -fU t H, 

• U s Ut = U s+t so in particular U- t = (ft) -1 . 

• Since H is self-adjoint, 

U? = exp(^) = U- t = (ft)- 1 

and so 6^ is unitary. 

The first is clear; the second follows from differentiating (84); for the third, consider 

w t = u s u t - u s+t , 



3 The Hamiltonian H could depend on time, but for the following expression to be true we will assume 
that Hamiltonians at different times always commute. This will always be true for the examples seen in this 
course. 
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then W = and 



dWt i i 

-W = -H H{U ' U *- U ^ = -l HW < 
this is a first-order linear ODE so has a unique solution, which must therefore be Wt = 0. 
Then the fourth is clear. Now (85) is equivalent to 

U- t iP t = Vo (86) 
and we will prove it in this form. Differentiating (86) we obtain 

d TT d ( ,iHt. , \ ,iHt. ,diht i * 

-C/- t ^ = -jr exp( — )^ = exp(— )(-£ + -/^t) 



<it \ h J h dt h 

which vanishes by (82). So U-tipt is independent of t and we can evaluate it att = 0, obtaining 
(86). 

Thus time-evolution following from the Schrodinger equation (82) is a sequence of unitary 
transformation (85). In particular 



ll^ll 2 = = (UtMUt^o) = (i>o\U t *U t i/; ) = (^ol^o) = HV'oll 2 (87) 

i.e. unitary evolution preserves the norm. 

3.5 Measurements 

When we measure an observable A the result is one of its eigenvalues with some probability. 
If we measure it again at once, then we should get the same answer with probability one. 
Thus the dispersion is zero, and the state just after a measurement must be an eigenstate of A 
with the measured eigenvalue. There may be more than one eigenstate with such eigenvalue, 
so we are led to make Luder's Postulate: 

Suppose a measurement of A in a state ip gives the result a; let Q a be the projection onto the 
subspace of H spanned by the eigenvectors of A with eigenvalues a; then immediately after 
the measurement ip has become Q a ip. 

(Recall that a projection is a linear operator with Q 2 = Q). We say the measurement has 
collapsed the wave function. The norm changes in this collapse so that it is not unitary. 

4 The Commutation Relations 

(Hannabuss chapter 7) 
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4.1 The Commutation Relations 

Recall, in one dimension, the definitions of position and momentum operators as 

Xip = xxjj, Pip = —ihtp' 
then PX-p = —ih(xip)' while XPip = x(-ihip') so that 

(PX - XP)ip = -ihp (88) 
Generally, we define the commutator of observables A, B as 

[A, B] = AB — BA, (89) 

then we have 

[P, X] = -ihl, (90) 
and more generally, in three-dimensions 

[P i ,X j ] = -ih5 ij I, (91) 

For completeness, note also 

[X it Xj] = [Pi,Pj] = (92) 
Commutators have an algebra which includes the following properties: 

1. [A,B] = -[B,A] 

2. [A,BC] = [A,B}C + B[A,C] 

3. linearity in A and B, e.g. [A, aB + /3C] = a[A, B] + /3[A, C] 

4. The Jacobi identity: [A, [B, C}} + [B, [C, A}} + [C, [A, B}} = 0. 

5. For self-adjoint A,B , [A, B] is not self-adjoint, but i[A, B] is. 

I leave the proofs as an exercise! 

4.1.1 Aside: Similarity with Poisson Brackets 

(Read this only if you know what a Poisson Bracket is) 

The commutation relations between Xi and Pi strongly resemble the classical Poisson Bracket 
relations for generalized coordinates and momenta. Recalling that the Poisson bracket is 
defined by 
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we have 

{pi,x j } = 5 ij , {xi,Xj} = {pi,pj} = (94) 

Furthermore, the above properties of the commutator also paralell those of the Poisson 
bracket. 

The similarity with Poisson brackets led Dirac to suggest that each function classical / (for 
instance, the coordinate x or the momentum p) should be related in quantum theory by an 
operator Q(f), in such a way that for any pair of functions /, g the corresponding operators 
satisfy 

[Q(f),Q(g)] = -ifiQ({f,g}) (95) 

This idea doesn't work simultaneously for all operators, and sometimes one faces ambiguities. 
For instance, to the classical function px 2 we could assign PX 2 or X 2 P or PXP, and so on. 
These differ from each other, since P and X do not commute, but their difference is "small", 
namely, proportional to U. 

4.2 Heisenberg's Uncertainty Principle 

We will start with a 
Lemma 

Suppose A, B, C are self-adjoint with 

[A, B] = iC 

and n is real. Then 

1. (A - t/2B)*(A - ifiB) = A 2 + fiC + n 2 B 2 ; 

2. for normalized ip e % 

1 1 (A - ifiB)iP\\ 2 = E^A 2 ) + fjE^C) + /i%(5 2 ); 

3. for any normalizable if) e H 

E4A 2 )E4B 2 ) > ^(E^(C)) 2 (96) 
with equality iif (A — ijiB)i[) = for some real \i. 
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Proof 

The first one is obvious (just multiply); for the second 



\\(A - iiiB)^\\ 2 = ((A - ifiB)t/j\(A - ifiB)tfj) = {tfj\(A - i(iB)*(A - ifiB)ip), 

and use the first; now the LHS is non- negative so that the discriminant of the RHS as a 
quadratic in \i must be non-positive, which gives the third. 

Corollary: Heisenberg's Uncertainty Principle 

For normalized if), 



A4P)A4X) > l -h (97) 



with equality iff 



ip = const, x exp ( — 7^( x ~~ a ) 2 ) (98) 
for some real \x and complex a. 
Proof 

Set A = P — E^(P)I, B = X — E^(X)J so these are self-adjoint and 

[A, B] = [P, X] = -ihl 

so take C = —hi. Note that 

E^A 2 ) = (A^(P)) 2 , E V ,(P 2 ) = (A^(X)) 2 , E V ,(C) = -h, 
then the result follows from (96). For equality we need with 

(A - ifiB)ij; = 0. 
Introduce j3 = E^,(P) — i/uM^(X), then 

(A - iuB)ip = i-ih-f- - iux - P)ip = 
ax 

which is a separable ODE leading to 

logip = — T^O^ 2 - 2 — 2if3x + const.) 

which is (98) with a = 

We shall call a state with wave-function like (98) a minimum uncertainty state. You saw an 
example in Part A, the ground state of the harmonic oscillator which had 

2 

TflbJX 

*&o(x) = const, x exp( — — ) 
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4.3 Simultaneous Measurability 



Why is (97) called the Heisenberg Uncertainty principle? It implies that a reduction in the 
dispertion of P, A^(P), must be accompanied by an increase in the dispersion of X and 
vice versa - there is a fundamental limit on reducing the common uncertainty. Thus P and 
X cannot be simultaneously be measured to arbitrarily high accuracy, and the origin of this 
is in the nonvanishing of the commutator. The same will hold for any pair of observables 
which don't commute. 

What if [A, B] = 0? For a finite-dimensional % there will then be a basis of common 
eigenvectors, and on a common eigenvector A and B can be simultaneously measured exactly, 
i.e. the dispersions vanish. For an infinite-dimensional % one can do less. If A, B are self- 
adjoint introduce: 

%a = span of eigenvectors of A. 

He = span of eigenvectors of B. 

Hab — span of common eigenvectors. 

Then Claim: If AB = BA then H AB = U A ^\U B 

Then we can say the following: 

Corollary 

If Ti has an orthogonal basis of eigenvectors of A (so %a = H) and [A, B] = then Hab = / Hb 
so that A, B can be simultaneously diagonalized on Kg. 

We shall see below, in section 5, some important application of this observation. There we 
shall have operators which commute with the Hamiltonian, so that energy eigenstates (i.e. 
eigenstates of the Hamiltonian) can simultaneously be eigenstates of these other operators. 
A result which can usefully be given here concerns operators which do not commute with 
the Hamiltonian. We have 

Proposition 

For any observable A write (A) = (ijj\Aijj) then 



This follows rapidly from (66) by differentiating (ip\Aijj). For an A with non explicit time- 
dependence the first term is zero, so that the rate of change of (A) is determined by the 
commutator [H, A]. 

4.4 The Harmonic Oscillator Revisited 

Now we come to the pay-off for this abstraction. Recall that the harmonic oscillator has 
Hamiltonian 



d 
dl 



r) A i 
W = (^) + -A[H,A}) 



(99) 



1 




(100) 



H 



2m 
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Inspired by section 4.2 above, define 



a_ = P - imuoX (101) 
This operator is not self-adjoint and in fact its adjoint is 

a+ = ( a _)* = P + imuX (102) 
These operators have the following important properties: 
Lemma 



1. 



[a_, a + ] = 2mhul 
a + a_ = 2m(H — -tvjjl) 

a_a + = 2m(H H — hwl) 



\\a_^\\ 2 = 2m(E lp (H)-^hu J )\\ij\\ 2 
||a + ^|| 2 = 2m(E^(^) + ^)ll^ir 



[H, a_] = —ftwa_ 
[H, a + ] = hwa + 

Proof: 1 is straightforward; for 2 write (a_ip\a_il)) = (ip\a + a_ip) and use 1; for 3 use 1. 

Proposition: The Harmonic Oscillator done by Algebra 

Suppose H from (100) has a normalized eigenvector ip e % then 

1. there is a normalizable ground state tpo with energy eigenvalue E = \hw\ 

2. there are normalizable excited states ip n = (a+) n tpo for each positive integer n, with 
energy eigenvalue E n = (n + \)hw; 

3. if we take P = —ifrS. and X = x then ip = const. exp(— m ^-) and the ip n , n — 0, 1, 2, ... 
are all the eigenstates. 
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Proof 

Suppose the given normalized eigenvector satisfies 



Hip = Eip 

then 

Ha.ip = (Ha_ - a_H + a_H)ip = (a_H + [H, aJ/)ip = (E - hw)a„ip 

by part 3 of the Lemma. That means that a-ip is also an eigenvector of H, but with 
eigenvalue (E — hid), i.e. a_ "lowered" the eigenvalue by hid. Also 

||a_^|| 2 = 2m(E4H) - ^hd)\\iP\\ 2 = 2m{E - ^MIMT 
by part 2 of the lemma, so that this new eigenvector is normalizable. 

We can then apply a_ again. We repeat this process obtaining normalizable eigenstates 
(aJ) k ip with eigenvalue E k = E — khw. However, this process must stop as 

\\{a_) k 4j\\ 2 oc {E - [k - l/2)huj), 

which cannot be negative. Thus the process will stop when we reach ip with E = E = ^t\w 
and 

||a_^o|| 2 = 0, i.e. a-ipo = 

so that further lowering gives zero. (At this point we don't know the dimension of her a_, 
but we do know that any element of it has E = E = hfuJ). Now we start to 'raise': set 
ipi = a + ip then 

Hipi = (Ha + — a + H + a+H)ipQ = (hcu + -ftw)a + ipo = f\u(l + l/2)ipi 

Inductively, raising will give normalizable states ip n = a + ip n -i with eigenvalues E n = (n + 
1/2) fad. This process can be repeated indefinitely and we have a chain of eigenstates, labelled 
by the positive integers, arising from each candidate ipo. Every eigenstate must be in one of 
these chains, since the only elements of ker a_ are lowest-energy states so that a + and a_ 
are one-to-one going up and down the chain. 

As regards ipo if we make the usual identification P = —ih-^ and X — x, then we have 

= a^ipo = (P — imujX)ipQ =^> i—ih- imcdx)ipo = 

dx 

from which 

/ mux 2 

■ipo = const, x exp 

\ 2h 

This gives a unique normalized ground state ipo, i.e. in this case dim( ker a_) = 1, and 
every other state is obtained by raising this one. 
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4.5 Uniqueness of the Commutation Relations 



This is a technical aside: the algebraic approach of the previous subsection started just from 
the commutation relations 



[P, X] = -ih 

We only used the identification P = —ihjj^ and X = x to find ipo(x) and hence %li n (x). 

What freedom do we have in making this identification for P and XI If we had instead 
proceeded by assuming that fcer(a_) was one-dimensional, spanned by fl say, then the nor- 
malizable eigenstates would be ip^ = («+) n ^- Now we would have the Hilbert space % 
spanned by these and there would be an isomorphism of Hilbert spaces defined by relating 
the bases: 

Under this isomorphism the abstract P, X acting on would map to the operators —ih4^, x 
acting on our previous Hilbert space of functions. Thus the identification P = —ifr4z and 
X = x is unique up to isomorphism. 

5 Angular Momentum 

(Hannabus chapter 8) 

5.1 Angular Momentum in Quantum Mechanics 

Motivated by the definition of angular momentum from classical mechanics: 

L — x A mv = x A p (103) 
we define in quantum mechanics the observables 



In 


— -^2-^3 


— X3P2 




= X 3 P 1 


-X 1 P 3 


u 


= X 1 P 2 


-x 2 p 1 



(104) 

We call this L orbital angular momentum. More briefly 

U = € ljk XjPk (105) 
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using the Einstein summation convention (repeated indices are automatically summed over 
their whole range) and in terms of the alternating symbol defined by 



6ijk = 1 if ijk is an even permutation of 123 
= — 1 if ijk is an odd permutation of 123 
= otherwise 

(so in particular (a A b)i = €ij k ajb k ) 
Properties 

1. [Li, Pj] = iheijkPk', 

2. [Li, Xj] = iheijuXk] 

3. [Li, Lj] = iheijkLk 

Proof 

We'll just do the first of these. One can do them in an unsophisticated way, calculating e.g. 
[Li, P 2 ] and appealing to symmetry or in a more sophisticated way as follows: 

[Li, Pj\ [^imnX m P n , Pj\ Gi mn [X m , Pj^P n ih£i mn S m jP n iTi€ij n P n 

as required. This needs some familiarity with the Einstein summation convention, a skill 
worth adquiring. 

We shall call any set J = (Ji, J 2 , J3) of observables which satisfy 

[Ji, Jj] = ihe ijk J k (106) 
an angular momentum operator, and we will study them in the abstract. 
More Properties 

If [Ji,Aj] = ihe ijk A k and [Ji,Bj] = ihe ijk B k then 

[Ji,A-B] = (107) 

Proof 

[Ji,A-B] = [J h AjBj] = [J u Aj]Bj + Aj[Ji, B \ = iht ijk (A k B 3 + AjB k ) = 

where the last step follows because e^fe changes sign under switching a pair of indices and 
(A k Bj + AjB k ) does not. 

We deduce in particular that P-P,P-X,X-X,LL and similar expressions all commute 
with L. 
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5.2 Ladder Operators 

We have just seen that 



[J\ Ji] = (108) 
writing J 2 for J ■ J. We define the ladder operators 

J± = Ji±iJ 2 (109) 
and note that (J+)* = J- assuming the Ji are self-adjoint. 
Properties 

1. [J 2 ,J±] = o, 

2. J+ J_ = J 2 - J 2 + hJ 3 , 

t/_ t/_|_ = t/ t/g /it/ 3 , 

[J+, J_] = 2/lJ 3 , 

3. [J 3 ,J ± ] = ±£J± 

1 is done and 2 and 3 follow rapidly from the definitions. 

Now, much as with the algebraic treatment of the Harmonic oscillator, we will find eigen- 
vectors of J 2 and J3. They commute so we will be looking for simultaneous eigenvectors. 
Assume these exist in some "H, so there is a ip with 

J 2 ip = Xh 2 ip, J 3 ip = mhip, 
for real A, m. Consider the new states 

= J+ip, ip- = J-i/j, 

then 

j 2 ^_ = J 2 J-i> = (J 2 J- - J~J 2 + J^J 2 )^ = \h 2 ^_ (110) 

i.e. since [J 2 , J_] = 0, is an eigenvector of J 2 with the same eigenvalue. This is true for 
ip + too. 

Next 

J 3 ^- = J*J-il> = (J3J- ~ J-J3 + J-Jsty = {m- 1)7^- (HI) 
so the J3— eigenvalue is lowered by one for ip_ (and raised by one for 
Next calculate 

||^_|| 2 = (J-ip\J-ip) = (ip\J+J-ip) = (A - m 2 + m)h 2 \\il)\\ 2 (112) 
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which must be non-negative, as 

||^ + || 2 = MM) = (i>\J~J + i>) = (\-m 2 -m)h 2 m 2 (113) 

which also must be non negative. 

Putting these last two results together, we have 

A > mim — 1) and equality iff J-tf; = (114) 
A > mim + 1) and equality iff J + ip = 

This is enough to obtain the results below. 

5.3 Representation of the Angular Momentum 

Here 'representations' means as operators acting on some explicit vector space, so we suppose 
we have a finite-dimensional, complex vector space % of eigenvectors of J 2 : 

J 2 i/j = \h 2 i/j for all if) e H. 
J 3 will have an eigenvector in % say ip with 

J^ip = mhip 

We can obtain more eigenvectors of J3 by raising and lowering with J±, when m will go up 
and down in integer steps. This can't be repeated indefinitely, since (114) implies 

AH — >m 2 ±m+- = (m± -) 2 
4 4 2 

so that \m\ can't be arbitrarily large. Thus there will be a maximum m, call it j with 
corresponding ipj, and further raising must give zero: 

J+lpj = 

By (113), if the LHS vanishes, we find 

A=j(j + 1) 

We lower ifjj in integer steps and this will stop at a value of m at which further lowering 
gives zero. By (112) this will happen when 

= A — m 2 + m = j(j + 1) — m 2 + m = (j + m)(j — m + 1) 
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i.e. at m — —j since m = j + 1 is not allowed (m must be less than its maximum value 
which was j) 

Thus we lower in integer steps from m = j to m = — j, which requires 2j to be a non- negative 
integer. 

Summarizing: 

• the eigenvalues of J 2 are j(j + l)h 2 where j = 0, |, 1, §, 2, 

• for each fixed j, the eigenvalues of J3 are mh with m = — j, — j + 1, j — 1, j; 

• assuming no degeneracies, the eigenspace on which J 2 = j(j + l)h has dimension 2j + 1. 

• we can choose an orthonormal basis {ip m } whith 

ci = [(jTm)(j±m+l)] 1/2 
We have proved all of this except the last part which comes from (113) and (112). 
Important example: j = \ (also called the spin representation) 

Now 2j + 1 = 2 so % is 2-dimensional. The allowed values of m are m = ±| so write if}± for 
-0±i/2, then 

J 3 ^± = ±ifc0± (115) 

and 

J + ^ + = J_^_ = 0, J + ^_ = /ty> + , J_?/; + = fa/*- (116) 
We can introduce a matrix formalism by taking as basis 

^ = ( I ) ' ^ + = ( 1 ) 
Then (115) and (116) fix the form of the operators to be 

h = \ h {\ -l)' J+ = h {l J)' J - = h (°i 0) 

so also 

ji= H° 0)' j2= H* 7) 

and one may check all the commutation relations. These formulae bring to our attention the 
Pauli spin matrices: 
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ai= (? J)' a2= (-z o)' a3= (o -l) (11T) 
which are Hermitian, trace zero and define the Jj for j = 1/2 via 

Ji = \ha t (118) 
from which many algebraic relations follow. 

5.4 Orbital Angular Momentum and Spherical Harmonics 

Let us focus in the particular case of orbital angular momentum J = L = X A P. We seek 
a space of functions for L to act on. As we will shortly see, we need j to be an integer. To 
justify this claim recall 

L 3 = X X P 2 - X 2 P, = -ih(x— - y—) = -ih— (119) 
in terms of spherical polar coordinates. Now the eigenvalue equation becomes 

L 3 i/j = -ih 9 ^ = mhifj (120) 
ocp 

so that 

i) = F{r,B)e im<i> (121) 

Now the important point: if this is genuinely a function, i.e. single-valued, then we need m 
to be an integer, so j is also an integer. (We also see that j = 1/2 must be associated with 
something, not a function, which changes sign under rotation by 2ir.) To pursue this line 
further, lets calculate the other operators in spherical polar coordinates 



L - = -^{m-' mt0 T4) (122) 



d9 2 36 sin 2 fl<90 2 _ 

We will find the eigenfunctions explicitly, so write ip^ m for the one with 

L 2 ipe m = £{£ + l)h 2 tpem, L 3 i[)i m = mhip em (123) 
then the highest m-value is £ and ipu satisfies 
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Ls^ee = ~ ifi ~Q^ = Ztyu, L + i) U = (124) 

The first is solved by ipee — F(9)e l£( ^, where we are only looking at the 9, <fi dependence (the 
r-dependence will be reinstated in the next section), and then the second is 



L^ u = he 1 * - t{cot 9)F^J e il * = 



whence F = const, x (sm9) e , so 

4>u = const, x (sinfl)V^. (125) 

The constant can be fixed by normalization. Once we have this we obtain the other ip£ m by 
lowering with L_. We shall write Ye m (9, 0) for the corresponding functions. Since they arose 
as eigenfunctions we know at once that 

2tt pit 



(ipe imi \ipe 2m2 } = / Y eimi Yi 2m2 sin9d9d(f) = 0, if (4, mi) ^ (£ 2 , m 2 ). (126) 
Jo Jo 

5.5 The Hydrogen Atom Again 

With a better understanding of angular momentum we can do a more thorough treatment 
of the hydrogen atom. The time-independent Schrodinger equation for a particle of mass m 
and charge — e (the electron) in the electric field of a fixed point of charge Ze (the nucleus, 
so Z = 1 for the hydrogen atom) is 

H 2 Ze 2 , , 

V 2 ^ * = 127 

2m Aireor 

and the Laplacian in spherical coordinates is 



(¥_ 2d_ \_f&_ cote IL 1 & 

Q r 2 r Q r r 2 \ QQ2 QQ g j n 2 Q QX2 I 



From (122), we recognize that the angular derivatives are proportional to L 2 , so that we can 
separate \& = F(r)Ye m (9, 0). We then obtain 

^=(l(rFy- i ^ F y em (129) 

writing prime for d/dr. 

We simplify the Schrodinger equation by renaming quantities: 

h 2 K 2 n 2m Ze 2 . . 

£ = " = ^w < 130 > 
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Further, we write F(r) in terms of a new function: 



F(r) = f(r)e- Kr (131) 
We find that / satisfies the following equation 

/"+f 2 --V-(^ + ^V-0 (132) 



r 



We seek a series solution 



f = J2a n r n+c , withao^O, (133) 



n=0 



and substitute into (132) to obtain 

+ c)(n + c- l)r n+c ~ 2 + - 2^ a n (n + c)^" 1 - + ^J^) a n r n+ ^ = 

(134) 

the lowest power is r c 2 and the coefficient is 

a Q {c{c - I) + 2c -£{£+!)) 
This must vanish and ^ so we obtain the indicial equation: 

(c(c - 1) + 2c - ^ + 1)) = (c - £)(c + £ + 1) = (135) 
The root c = —(£ + 1) is not allowed as this would give unbounded at r = 0, thus c — £. 
For the general power r n+c ~ l we obtain the recurrence relation: 

a n +i((n + c+l){n + c) + 2(n + c+ !)-£{£ + 1)) = a n (2re(n + c) + 2k - /3), (136) 

which determines all terms in the series from do- If the series continues to infinity, then for 
large n 

a n+ i 2k 
a n n 

and so 

/ « e 2Kr 

Tracking back through (131), we see that this will give a non-normalizable So normal- 
izability requires the series to terminate and from the recurrence relation there must be a 
non-negative n' for which the RHS vanishes, i.e. 

2«(n' + £ + 1) = P (137) 
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where we have also substituted for c. 

Set n = n' + £ + 1, which is a positive integer, then the series for / takes the form 



/ = /« = /^a fc r fc , (138) 

k=0 

which is a polynomial of degree n — 1, with corresponding 



k = K n = (139) 



The wave function is 



^ nlm = const, x f n {r)e- Knr Y em {8, 0) (140) 
and the energy eigenvalue is 

E = En = .*A = ^L 2 (141) 

2m 8m n 2 

The integers n, £, m, which label the state, are known respectively as the principal quantum 
number, the azimuthal quantum number and the magnetic quantum number. The energy 
eigenvalue notices only n, not i or m. Any spherically symmetric potential will ignore m, 
which only comes into E when spherical symmetry is given up, for example by putting the 
atom in a magnetic field (whence the name), but spherical potentials other than the pure 
Coulomb (i.e. V ~ -) will have i appearing in E. 

We can count the degeneracy of an energy eigenvalue as follows: given n = n' + i + 1 we 
have < i < n — 1 and for each I there are 21 + 1 choices for m, so that the degeneracy is 

n-l 

^(2£ + l)=n 2 

6 A Charged Particle in a Magnetic Field 

(this section is not part of the course and so not examinable, but it makes a link back to the 
material on electromagnetism, and introduces some interesting new ideas.) 

To connect the second half of the course with the first, we briefly consider the quantum 
version of a classical charged particle of mass m and charge e moving in a electromagnetic 
field. In the case of an electric field, with electric potential say $, we know how to do this: 
simply consider the Hamiltonian 

H = — \P\ 2 + V(X) 
2m 1 1 v ; 

with V(X) = e$(X). The hydrogen atom is just like this. 
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In the case of a magnetic field, recall that the force does not work on the particle, so does not 
contribute a potential energy term: something new is needed. To motivate what this is, and 
because it gives some insight into how classical mechanics can be recovered from quantum 
mechanics, look at (99): 

A f)A j 

With the Hamiltonian above, this will give 

^x^^x^^P^x^Lp, 

and 

|<P )= °<p,P ])= ° [n x),P H _<w> 

Combining these two we obtain 

m^(X) = -(VV) (143) 

which closely parallels the classical equations of motion for a particle in a potential, which 
read 



m— f=-W (144) 

(JjV 

In other words, the expectation values satisfy the classical equations of motion. Now we 
consider the Hamiltonian 

H = ^ ((Pi + \eBYf + (P 2 - \eBXf + P 3 2 ) (145) 

which is similar to one introduced on one of the questions sheets, where we found the energy 
levels without the term P 3 2 . What classical problem does this corresponds to? We find 

| W = i(P,X]) = I(P 1 + ie B y). W (146) 
where the right-hand-side serves to define the "velocity" V\. Similarly 

s (y >4<p- y ]>=>4 Bx >^> < i47 > 

while 



d 

7ft 



(Z) = -([H,Z]) = -(P 3 ) = (V 3 ) 



h 



148] 



m 



Then 
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£ W = '<[ir,iy> = i e B W 



so that 



d 2 

m— (Y) = -eB(Vx) (149) 



dt 



■2 



This system corresponds to the classical equation 



m-—x = evAB (150) 

with £> = which is the equation of motion of a classical particle of mass m and charge 
e moving in the magnetic field B, which is uniform and in the ^-direction 

B=(0,0,B) 

To see how to get (145) note that this magnetic field has vector potential 

A=±B(-y,x t 0) (151) 



so that (145) may be obtained by starting with the free particle Hamiltonian H — J^\P 
and making the replacement 
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P^P-eA (152) 

It is a straightforward calculation to see that this is the correct prescription more generally 
so that one could, for example, go on to calculate the energy levels of a hydrogen atom in a 
uniform magnetic field. We won't do that, but to end we will note a new interpretation to 
gauge transformations that this gives. Recall a gauge transformation was a change in A: 

A^ A = A + V( (153) 
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which leaves the magnetic field unchanged. This should therefore lead to no physical change, 
but it will change the term 5— \P — eA\ 2 in H. The resolution of this puzzle is to make a 
simultaneous change in the wave function: 

ij) —)■ ip = ^ Qxp(ie( / h) (154) 

This change doesn't affect the probability density p or probability current j, which are the 
physical observable quantities, and also 

|P - e A\ 2 i) = exp(ie(/h)\P - eA\ 2 *l) (155) 

so that the term exp(ie(/h) factors out of the Schrodinger equation, and solutions will 
transform to solutions. 
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